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The lognormal distribution describing, e.g., exponentials of Gaussian random variables is one of 
the most common statistical distributions in physics. It can exhibit features of broad distributions 
that imply qualitative departure from the usual statistical scaling associated to narrow distributions. 
Approximate formulae are derived for the typical sums of lognormal random variables. The validity 
of these formulae is numerically checked and the physical consequences, e.g., for the current flowing 
through small tunnel junctions, are pointed out. 
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I. INTRODUCTION: PHYSICS MOTIVATION 

Most usual phenomena present a well defined aver- 
age behaviour with fluctuations around the average val- 
ues. Such fluctuations are described by narrow (or 'light- 
tailed') distributions like, e.g., Gaussian or exponential 
distributions. Conversely, for other phenomena, fluctu- 
ations themselves dictate the main features, while the 
average values become either irrelevant or even non exis- 
tent. Such fluctuations are described by broad (or 'heavy- 
tailed') distributions like, e.g., distributions with power 
law tails generating 'Levy flights'. After a long period 
in which the narrow distributions have had the quasi- 
monopoly of probability applications, it has been realized 
in the last fifteen years that broad distributions arise in 
a number of physical systems 0, 0, El • 

Macroscopic physical quantities often appear as the 
sums S n of microscopic quantities xc 



*S*n ^ ^ X% 



(1) 



where x\,X2, ■ ■ ■ ,x n are independent and identically dis- 
tributed random variables. The dependence of such sums 
S n with the number n of terms epitomizes the role of the 
broadness of probability distributions of Xi's. One intu- 
itively expects the typical sum to be given by: 

Si~n(x), (2) 

where (x) is the average value of x. The validity of eq. (0) 
is guaranteed at large n by the law of large numbers. 
However, the law of large numbers is only valid for suf- 
ficiently narrow distributions. Indeed, for broad distri- 
butions, the sums S n can strongly deviate from eq. J5J). 
For instance, if the distribution of the x^s has a power 
law tail (cf. Levy flights, 0), oc l/x 1+a with < a < 1 
((x) = oo), then the typical sum of n terms is not pro- 
portional to the number of terms but is given by: 



A/a 



(3) 



Physically, eq. @ (narrow distributions) and eq. © 
(Levy flights) correspond to different scaling behaviours. 
For the Levy flight case, the violation of the law of large 
numbers occurs for any n. On the other hand, for other 
broad distributions like the lognormal treated hereafter, 
there is a violation of the law of large numbers only for 
finite, yet surprisingly large, n's. 

These violations of the law of large numbers, whatever 
their extent, correspond physically to anomalous scaling 
behaviours as compared to those generated by narrow 
distributions. This applies in particular to small tun- 
nel junctions, such as the metal-insulator-metal junctions 
currently studied for spin electronics [j, |f| . It has indeed 
been shown, theoretically @ and experimentally @>I3, 
that these junctions tend to exhibit a broad distribu- 
tion of tunnel currents that generates an anomalous scal- 
ing law: the typical integrated current flowing through a 
junction is not proportional to the area of the junction. 
This is more than just a theoretical issue since this devi- 
ation from the law of large numbers is most pronounced 
0,0 f° r submicronic junction sizes relevant for spin elec- 
tronics applications. 

A similar issue is topical for the future development of 
metal oxide semiconductor field effect transistors (MOS- 
FETs). Indeed, the downsizing of MOSFETs requires a 
reduction of the thickness of the gate oxide layer. This 
implies that tunnelling through the gate becomes non 
negligible[T3[ll, generating an unwanted current leak- 
age. Moreover, as in metal-insulator-metal junctions, the 
large fluctuations of tunnel currents may give rise to seri- 
ous irreproducibility issues. Our model permits a statis- 
tical description of tunnelling through non ideal barriers 
applying equally to metal-insulator-metal junctions and 
to MOSFET current leakages. Thus, anomalous scaling 
effects are expected to arise also in MOSFETs. 

The current fluctuations in tunnel junctions are well 
described by a lognormal probability density 0, 0] 



f(x) = LN( f i,o- 2 )(x) = 
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depending on two parameters, /x and a . The lognor- 
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mal distribution presents at the same time features of a 
narrow distribution, like the finiteness of all moments, 
and features of a broad distribution, like a tail that can 
extend over several decades. It is actually one of the 
most common statistical distributions and appears fre- 
quently, for instance, in biology 0] and finance 0] 
(for review see 0, In physics, it is often found 

in transport through disordered systems such as wave 
propagation in random media (radar scattering, mobile 
phones.. ..1 (l7l ITsf. A specially relevant example of the 
latter is transport through ID disordered insulating wires 
for which the distribution of elementary resistances has 
been shown to be lognormal |19|. This wire problem of 
random resistances in series is equivalent to the tunnel 
junction problem of random conductances in parallel |20j |. 
Thus, our results, initially motivated by sums of lognor- 
mal conductances in tunnel junctions, are also relevant 
for sums of lognormal resistances in wires. 

In this paper, our aim is to obtain analytical expres- 
sions for the dependence on the number n of terms of 
the typical sums S l n of identically distributed lognormal 
random variables. The theory must treat the n and a 1 
ranges relevant for applications. For tunnel junctions, 
both small n ~ 1 corresponding to nanometric sized junc- 
tions 01 an d large n ~ 10 13 corresponding to millimet- 
ric sized junctions , a nd both small a 2 ~ 0.1 and large 
a 2 ~ 10 EHHUll,!!!] have been studied experimentally. 
For electromagnetic propagation in random media, a 2 is 
typically in the range 2 to 10 [lgj . 

There exist recent mathematical studies on sums of 
lognormal random variables |3 E3 that are motivated 
by glass physics (Random Energy Model). However, 
these studies apply to regimes of large n and/or large 
a 2 that do not correspond to those relevant for our prob- 
lems. Our work concentrates on the deviation of the typ- 
ical sum of a moderate number of lognormal terms with 
a 2 < 15 from the asymptotic behaviour dictated by the 
law of large numbers. Thus, this paper and treat 
complementary (n, cr 2 ) ranges. 

Section [H] is a short review of the basic properties of 
lognormal distributions, insisting on their broad char- 
acter. Section IIIII presents qualitatively the sums of n 
lognormal random variables. Section llVI introduces the 
strategies used to estimate the typical sum 5 1 * . SectionlVl 
the core of this work, derives approximate analytical ex- 
pressions of fi* for different <r 2 -ranges. Section IVII dis- 
cusses the range of validity of the obtained results. Sec- 
tion IVIII presents the striking scaling behaviour of the 
sample mean inverse. Section I V 1111 contains a summariz- 
ing table and an overview of main results. 

As the paper is written primarily for practitioners 
of quantum tunnelling, it reintroduces in simple terms 
the needed statistical notions about broad distributions. 
However, most of the paper is not specific to quantum 
tunnelling and its results may be applied to any problem 
with sums of lognormal random variables. The adequacy 
of the presented theory to describe experiments on tunnel 
junctions is presented in 



II. THE LOGNORMAL DISTRIBUTION: 
SIMPLE PROPERTIES AND NARROW VS 
BROAD CHARACTER 

In this section, we present simple properties (genesis, 
characteristics, broad character) of the lognormal distri- 
bution that will be used in the next sections. 

Among man y m echanisms that generate lognormal dis- 
tributions |15l 1 1 6| , two of them are especially important 
in physics. In the first generation mechanism, we con- 
sider x as exponentially dependent on a Gaussian random 



variable y with mean \i y and variance a 

x = xoe v ^ v ° 



2. 



(5) 



where xo and j/o are scale parameters for x and y, respec- 
tively. The probability density of y is: 



i 



2na 2 y 



: exp 
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(6) 



The probability density of x, f(x) = N(/j,y,o- 2 )(y)dy/dx 
is a lognormal density LN (u, a 2 ) (x), as in eq. (0J, with 
parameters: 



u = h inxo, 



(<7y/yoY 



(7a) 



(7b) 



A typical example of such a generation mechanism is pro- 
vided by tunnel junctions. Indeed, the exponential cur- 
rent dependence on the potential barrier parameters op- 
erates as a kind of 'fluctuation amplifier' by non-linearly 
transforming small Gaussian fluctuations of the param- 
eters into qualitatively large current fluctuations. This 
implies, as seen above, lognormal distribution of tunnel 
currents 

In the second generation mechanism, we consider the 
product x n = n™=i Vi °f n identically distributed ran- 
dom variables y\ , • • • , y n . If u' and a' are the mean and 
the standard deviation of lny^, not necessarily Gaussians, 
then 



In x T , 



i=l 



tends, at large n, to a Gaussian random variable of mean 
nu' and variance nu' 2 , according to the central limit the- 
orem. Hence, using eqs. (|7a|l and l|7b|l with x = y = 1, 
x n is lognormally distributed with parameters \l = tip! 
and a 2 = na' 2 . For a better approximation at finite n, 
see [26|. 

The lognormal distribution given by eq. I@J has the 
following characteristics. 

The two parameters [i and a 2 are, according to eq. I|7a|) 
and eq. (|7b|) with xq = yo = 1, the mean and the variance 
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FIG. 1: Examples of lognormal distributions LN (/x, <r 2 ) (x) 
with u — and a = 0.1, 1 and 1.5. When a increases, the typ- 
ical values x t , indicated by the dotted lines, and the means (x) 
move rapidly away from the constant median x m , indicated 
by the broken line, in opposite directions. 



of the Gaussian random variable lnx. The parameter fi 
is a scale parameter. Indeed, if x is distributed according 
to LN(/x, a 2 )(x), then x' = ax is distributed according 
to LN(/i' = + lna, a' 2 = <r 2 )(x'), as can be seen from 
eq. (J5J, eq. (j7a|) and eq. (|7b|) . Thus, one can always take 
/i = using a suitable choice of units. On the other hand, 
a 2 is the shape parameter of the lognormal distribution. 

The typical value x t , corresponding to the maximum 
of the distribution, is 



(9) 



The median, x m , such that /(x)dx = f(x)dx = 
1/2, is 

x m = e*». (10) 
The average, (x), and the variance, var (x) = (x 2 ) — (x) 2 , 



are 



(x) = e^ +a ' 2 ' 2 , 



var (x) = e 



K-0 



(11) 

(12) 



The coefficient of variation, C = yj var (x) / (x) , which 
characterizes the relative dispersion of the distribution, 
is thus 



C = y/e°* - l. 



(13) 



Note that /i does not appear in C, as expected for a scale 
parameter. 

Figure ^ shows examples of lognormal distributions 
with scale parameter fj, = and different shape param- 
eters. For small a 2 , the lognormal distribution is nar- 
row (rapidly decaying tail) and can be approximated 



by a Gaussian distribution (see Appendix ^J. When 
a 2 increases, the lognormal distribution rapidly becomes 
broad (tail extending to values much larger than the typ- 
ical value). In particular, the typical value x* and the 
mean (x) move in opposite directions away from the me- 
dian x m which is 1 for all a 2 . The strong er 2 -dependence 
of the broadness is quantitatively given by the coefficient 
of variation, eq. l|13fl . 

Another way of characterizing the broadness of a dis- 
tribution, is to define an interval containing a certain 
percentage of the probability. For the Gaussian distri- 
bution N (/z, cr 2 ) , 68% of the probability is contained in 
the interval \p — a, /x + a] whereas for the lognormal dis- 
tribution LN(/x, tr 2 ), the same probability is contained 
within [x m /e IT > x m x e a ]. The extension of this interval 
depends linearly on a for the Gaussian and exponentially 
for the lognormal. 

Moreover, the weighted distribution xf (x), giving the 
distribution of the contribution to the mean, is peaked 
on the median x m . In the vicinity of x m one has[ 



\j2no~ 2 i 



for e"-^ <x<e" +V2 ~ CT 



(14) 



Thus, / (x) behaves as a distribution that is extremely 
broad (1/x is not even normalizable) in an x- interval 
whose size increases exponentially fast with a and that 
is smoothly truncated outside this interval. 

Three different regimes of broadness can be defined 
using the peculiar dependence of the probability peak 
height / (x*) on a 2 . Indeed, the use of eq. J3J and eq. @ 
yields: 



/(**) 



2ne^a 



(15) 



For cr 2 « 1, one has /(x') oc 1/cr and thus /(x*) cx 
1/y/va.i (x) as yj var (x) cx a (see eq. iJH}). This in- 
verse proportionality between peak height f{x t ) and peak 
width yj var (x) is the usual behaviour for a narrow dis- 
tribution that concentrates most of the probability into 
the peak. 

When the shape parameter a 2 increases, still keep- 
ing a 2 < 1, f{x t ) is no longer inversely proportional to 
yj var (x) , however it still decreases, as expected for a dis- 
tribution that becomes broader and thus less peaked (see, 
in figure Gl the difference between a — 0.1 and a = 1). 

On the contrary, when a 2 > 1, the peak height in- 
creases with a 2 even though the distribution becomes 
broader (see, in figure ^ t ne difference between a = 1 
and a = 1.5). This is more unusual. The behaviour 
of the peak can be understood from the genesis of 
the lognormal variable x = e v with y distributed as 
N (/^ = /i, cr 2 = ct 2 ) (y). When a 2 becomes larger, the 
probability to draw y values much smaller than /i in- 
creases, yielding many x values much smaller than e M , 
all packed close to 0. This creates a narrow and high 
peak for fix). 
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This non monotonous variation of the probability peak 
f(x t ) with the shape parameter a 2 with a minimum in 
a 2 = 1, incites to consider three qualitative classes of 
lognormal distributions, that will be used in the next sec- 
tions. The class a 2 -C 1 corresponds to the narrow log- 
normal distributions that are approximately Gaussian. 
The class a 2 < 1 contains the moderately broad log- 
normal distributions that may deviate significantly from 
Gaussians, yet retaining some features of narrow distri- 
butions. The class a 2 ^> 1 contains the very broad log- 
normal distributions. 



III. QUALITATIVE BEHAVIOUR OF THE 
TYPICAL SUM OF LOGNORMAL RANDOM 
VARIABLES 

In this section we explain the qualitative behaviour of 
the typical sum of lognormal random variables by relating 
it to the behaviours of narrow and broad distributions. 

Consider first a narrow distribution fjq(x) presenting 
a well defined narrow peak concentrating most of the 
probability in the vicinity of the mean (x) and with light 
tails decaying sufficiently rapidly away from the peak (fig- 
ure |2K)- Draw, for example, three random numbers X\, 
xi and £3 according to the distribution /n(x). If /n(^) 
is sufficiently narrow, then x\, xi and X3 will all be ap- 
proximately equal to each other and to the mean (x) and 
thus, 



(a) narrow distributions 



S3 = Xi + X 2 + X3 ^ 3X1, 



2 or 3 



(16) 



Note that no single term xi dominates the sum 53. More 
generally, the sum of n terms will be close, even for small 
n's, to the large n expression given by the law of large 
numbers: 



Si, 



n(x) 



(17) 



Consider now a broad distribution f B {x) whose proba- 
bility spreads throughout a long tail extending over sev- 
eral decades (figure |2p; note the logarithmic x-scale) in- 
stead of being concentrated into a peak. Drawing three 
random numbers according to /b(^), it is very likely 
that one of these numbers, for example xi , will be large 
enough, compared to the other ones, to dominate the 
sum S3: 



S3 = xi + x 2 + X3 ~ max(a;i, £2,2:3) 
More generally, the largest term M„, 

M n ee max(xi, . . . , x n ), 
will dominate the sum of n terms: 

S n ^ M n . 



x 2 - 



(18) 
(19) 
(20) 



Under these premises, what is the order of magnitude 
of S n ? To approximately estimate it, one can divide the 




(b) broad distributions 




FIG. 2: Narrow vs broad distributions, (a) A narrow distri- 
bution /n(^c) presents a well defined peak and light tails. In a 
set {xi, . . . , x n } of n random numbers drawn from /n(x), no 
number is dominant, (b) A broad distribution /b(x) presents 
a long tail extending over several decades (note the logarith- 
mic a;-scale). In a set {xi,...,x n } of n random numbers 
drawn from fs{x), one number is clearly dominant. 



interval [0; 00) of possible values of x [37j into n intervals 
[01 = 0;a 2 ), [a 2 ;a 3 ), [a„;a n+ i = 00) corresponding 
to a probability of l/n: 



1 

n 



a j+1 



f B (x) dx . 



(21) 



Intuitively, there is typically one random number Xi in 
each interval [a 3 ;; a,j+\). The largest number M n is thus 
very likely to lie in the rightmost interval [a„;oo). The 
most probable number in this interval is a n (we assume 
that /b(^) is decreasing at large x). Thus, applying 
eq. H2Ufl the sum S n is approximately given by: 



S n 



with — = 

n 



f B {x) dx. 



(22) 



As a specific application, consider for example a Pareto 
distribution fp(x) with infinite mean, 



axQ 



for x > xq and with < a < 1. 

(23) 
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In this case, the sum S n is called a 'Levy flight'. The 

XqU 1 / and thus, using 



relation 
eq. J20J, 



yields H3 M, 



(24) 



Note that, as a < 1, the average value is infinite and thus 
the law of large numbers does not apply here. 

The fact that the sum S n of n terms increases typi- 
cally faster in eq. (|24[l than the number n of terms is in 
contrast with the law of large numbers. This 'anomalous' 
behaviour can be intuitively explained (see also figureC3in 
Q for a complementary approach) . Each draw of a new 
random number from a broad distribution /b(^) gives 
the opportunity to obtain a large number, very far in the 
tail, that will dominate the sum S n and will push it to- 
wards significantly larger values. Conversely, for narrow 
distributions /n(^), the typical largest term M* increases 
very slowly with the number of terms {e.g., as Vmn for 
a Gaussian distribution and as Inn for an exponential 
distribution; see, e.g., 28]), whilst the typical sum S* 
increases linearly with n and thus ^> M^. 

The question that arises now is whether the sum of 
lognormal random variables behaves like a narrow or 
like a broad distribution. On one hand, the lognormal 
distribution has finite moments, like a narrow distribu- 
tion. Therefore, the law of large numbers must apply 
at least for an asymptotically large number of terms: 
S n — ► n(x). On the other hand, if ct 2 is sufficiently 

n — >oo 

large, the lognormal tail extends over several decades, as 
for a broad distribution (see Section |nj. Therefore, the 
sum of n terms is expected to be dominated by a small 
number of terms, if n is not too large|39j. 

The domination of the sum by the largest terms can be 
quantitatively estimated by computing the relative con- 
tribution p q to the mean by the proportion q of statistical 
samples with values larger than some x g j.40j 41] 



x' f(x')dx' I (x) 



f(x')dx'. 



(25) 



(26) 



Figure shows a plot of p q vs q for various ct's. Note 
that the curve (1 — q, 1 — p q ) is called a Lorenz plot in the 
economics community when studying the distribution of 
incomes (see, e.g., 29}). For small ct's (ct < 0.25), one 
has p q ~ q for all q: all terms Xi equally contribute to the 
sum S n . This is the usual behaviour of a narrow distri- 
bution. For larger ct's, one has p q ^> q for q -C 1: only a 
small number of terms contribute significantly to the sum 
S„ . This is the usual behaviour of a broad distribution. 
Monte Carlo simulations of tunnelling through MOSFET 
gates yield p q vs q curves that are strikingly similar to 
figure Ejb (see figure 11 of [Hj). Indeed, the parameters 
used in [lOj correspond to a barrier thickness standard 
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FIG. 3: Heterogeneity of the terms of lognormal sums, a) 
Average proportion p q of the average physical quantity (x) 
carried by the average proportion q of the statistical sam- 
ple. For narrow lognormal distributions (a 2 <C l) , all terms 
equally contribute to the sums (p q ~ q). For broad lognor- 
mal distributions, a small proportion of the terms provide the 
major contribution to the sums (p q q for q <JC 1). b) Gini 
coefficient giving a quantitative measure of the heterogeneity. 



deviation of ad = 0.18 nm, a barrier penetration length 
A ~ 7.8 x 10~ 2 nm which gives ct = (r d /\ ~ 2.3 (see or 
for the derivation of a = a^jX). For this ct, figure 11 
of E3 fits our p q vs q without any adjustable parameter. 

As in economics, the information contained in figureyp, 
can be summarized by the Gini coefficient G represented 
in figure [^Jd: 



G = 2 ( Pq -q)dq, 



(27) 



giving a quantitative measure of the heterogeneity of 
the contribution of the terms to the sum. In the 
lognormal case this expression becomes: G (ct) = 



6 



1 - 2 N (0, 1) 0) $ (u - a) du, where N(0, l)(u) 
is the normal distribution (eq. 10) and <!> (u) = 
f_ N (0, 1) (u r ) du' the corresponding distribution func- 
tion. The solid line in figure |3b represents G (a) for var- 
ious cr's. As expected, G (a) varies from when a = 0, 
which means that all terms of a narrow lognormal distri- 
bution equally contribute to the sums, to 1 when a — > oo, 
which means that only a small proportion of the terms of 
a broad lognormal distribution contributes significantly 
to the sums. The broken lines in figure |3Jd represent an- 
alytically derived asymptotic approximations of G (a) : 

a < 1 : G (a) ~ (28a) 



2e~ CT2 / 4 

<7»1:G(<t)~1 -= — (28b) 

(Our derivations of these formulae, which are not ex- 
plicitely shown here, are based on usual expansion tech- 
niques). 

In summary, if a 2 is small, the sum of n lognormal 
terms is expected to behave like sums of narrowly dis- 
tributed random variables, for any n. Conversely, if a 2 
is sufficiently large, the sum of n lognormal terms is ex- 
pected to behave, at small n, like sums of broadly dis- 
tributed random variables and, at large n, like sums of 
narrowly distributed random variables (law of large num- 
bers). Before converging to the law of large numbers 
asymptotics, the typical sum may deviate strongly from 
this law. Moreover, if this convergence is slow enough, 
physically relevant problems may lie in the non converged 
regime. This is indeed the case of submicronic tunnel 
junctions Q. 

IV. STRATEGIES FOR ESTIMATING THE 
TYPICAL SUM 

In this section, we discuss strategies for obtaining the 
typical sum S l n of lognormal random variables depending 
on the value of the shape parameter a 2 . 

By definition S 1 * is the peak position of the distribu- 
tion of S n . Moreover, the latter is the n-fold convolu- 
tion of f(x) and is denoted as f n *(S n ). As no exact 
analytical expression is known for /"* when / is lognor- 
mal, one will turn to approximation strategies. These 
strategies can be derived from the schematic represen- 
tation, in the space of distributions, of the trajectory 
followed by /"* with increasing n (figure The set of 
lognormal distributions can be represented by an open 
half-plane (/i, a 2 ) with ji G (— oo;oo) and a 2 £ (0;oo). 
In this half-plane, the shaded region with a 2 <C 1 cor- 
responds to quasi-Gaussian lognormal distributions (see 
Appendix 1X1) . The whole lognormal half- plane is embed- 
ded in the infinite dimension space of probability distri- 
butions, which is schematically represented in figure 0] as 
a three dimension space. 




FIG. 4: Schematic representation of the trajectory of f n * 
in the space of distributions. The set of lognormal distri- 
butions corresponds to the open half-plane (u £ R, a 2 > 0) . 
The infinite dimension space of probability distributions is 
schematically represented here in three dimensions. In the 
region ((iGl,(r 2 <l) (shaded area), lognormal distribu- 
tions are quasi-Gaussian. (a) Narrow lognormal distributions 
/ 1 *(<t 2 <C 1) are quasi-Gaussian and, thus, the trajectory 
of f n * starts and ends up in the close vicinity of the line 
[fi £ R, a 2 = 0) . (b) For moderately broad lognormal distri- 
butions (a 2 < 1), the trajectory of f n * starts in the lognormal 
half-plane, not too far away from the quasi-Gaussian region, 
that is reached for asymptotically large n. Thus /"* is con- 
jectured to lie, for any n, close to the lognormal half-plane: 
f n * ~ LN(/i„,trJ). (c) Very broad lognormal distributions 
f 1 *(a 2 1) lie far away from the quasi-Gaussian region. 
Thus, there is a long way before /"* enters the quasi-Gaussian 
region and /"* has the possibility to come significantly out of 
the lognormal half-plane for intermediate values of n. 



The starting point f 1 * and the asymptotic behaviour 
/"* with n — > oo of the /"* trajectory are trivially known 
for any a 2 . Indeed, f 1 * = f = LN(/i, a 2 ) lies exactly in 
the lognormal half-plane. Moreover, the finiteness of the 
moments of the lognormal distribution f 1 * = f implies 
the applicability of the central limit theorem: 

f n *(S n ) - N(n(x),na 2 )(S n ), (29) 

n — >oo 

where N {n(x}, ncr 2 ) (S n ) is narrow since its coefficient of 

variation V na 2 /n(x) oc \j\fn tends to zero. As narrow 
Gaussian distributions are quasi-lognormal, as shown in 
Appendix [XJ /"* lies close to the quasi-Gaussian region 
of the lognormal half-plane. 

For intermediate n, on the contrary, the trajectory of 
/"* strongly depends on the broadness of the initial log- 
normal distribution f 1 * and three different cases can be 
distinguished. 

For narrow lognormal distributions (a 2 <§; 1), both 
the starting point f 1 * and the end point /"* for n = oo 
belong to the quasi-Gaussian region. Therefore one can 
assume that f n * is quasi-Gaussian for any n, which gives 
immediately the typical sums (see Section IY All . 

For moderately broad lognormal distributions (a 2 < 
1), /"* does not start too far away from the quasi- 
Gaussian region that is reached at large n. Hence, one 
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can assume that /"* remains close to the lognormal half- 
plane[42| in between n = 1 and n = oo. Thus, the ap- 
proximation strategy will consist in finding a lognormal 
distribution LN(/z„, a 2 ) (see broken line in figure^ that 
closely approximates f n * (see Section IV B|l . 

For very broad lognormal distributions (a 2 3> 1), / n * 
starts far away from the quasi-Gaussian region that is 
reached at large n. Hence, /"* may significantly come 
out of the lognormal half-plane. In this case, the approx- 
imation strategy is dictated by the fact that sums S n are 
dominated by the largest terms (see Section IV CI) . 



V. DERIVATION OF THE TYPICAL SUMS OF 
LOGNORMAL RANDOM VARIABLES 

In this section we apply the strategies discussed above 
in order to derive approximate analytical expressions of 
5* for different ranges of a 2 . 



A. Case of narrow lognormal distributions 

We consider here the case a 2 <C 1 of narrow lognormal 
distributions. As seen in Appendix lAl a narrow lognor- 
mal distribution is well approximated by a normal distri- 
bution: 



a 2 < 1 : LN U,a 2 ) 



N 



(f, (ae»f 



Consequently, the typical sum S 1 * is simply given by: 



a 2 < 1 : Si 



(30) 



as in the Gaussian case, for any number of terms. Note 
that the law of large numbers asymptotics S n — + n(x) — 

ne^ +(J / 2 , close to cq. <|3(J|) for a 2 -C 1, is applicable here 
even for a small number of terms. 



B. Case of moderately broad lognormal 
distributions 



We consider here the case a 2 < 1 of moderately broad 
lognormal distributions that already allows considerable 
deviation from the Gaussian behaviour obtained for a 2 <C 
1 (see Section IVA) . The distribution /"* of S n is now 
conjectured to be close to a lognormal distribution: 



a 2 < 1 : f n * (S n ) ~ LN U 



>°"n) ( S n) 



(31) 



Two equations characterizing f n * are needed to deter- 
mine the two unknown parameters /i„ and cr 2 . 

The cumulants provide such exact relationships on 
/"*. In particular, the first two cumulants 43], (S n ) and 
var(SVi) obey: 



(S n ) = n(x) 



(32a) 



var (S n ) — nvar (x) 
These equations imply 
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n 



(32b) 



(33) 



1 /2 

where C„ = [var (S n )\ 1 / (S n ) is the coefficient of vari- 

m 



ation of S n 



As / is lognormal and /"* is approxi- 



mately lognormal, one has C 



1 and Cl 



■1 



(see eq. 1)130). Then, using eq. (J2HJ), we obtain 

l" 



In 1 



In 1 



9l 
n 



(34) 



At last, we derive fj, n by developing eq. 132a|) using 
cq. (JTlJ: 



(35) 



Thus, thanks to eq. one has: 




1 / C 2 
ln(n(x)) In H 

2 \ n 



(36) 



In the remainder of this section, we will examine the 
consequences of eq. I|34f) and eq. (|3()|) on the typical sum 
S n , on the height of the peak of f n * and on the conver- 
gence of /"* to a Gaussian. 

The typical sum S n derives from eq. l|3*l|l , eq. (|3*rJ)l and 
eq. ©: 



< 1 : S* 



1 



'1 



-) 



3/2 ' 



(37) 



The typical sum S n appears as the product of the usual 
law of large numbers, n(x), and of a 'correction' factor, 
(1 + C 2 /n)~ 3 / 2 , which can be very large. The square of 
the coefficient of variation defines a scale for n: when 
n ^> C 2 , the law of large numbers approximately holds, 
whereas when n <C C 2 , the law of large numbers grossly 
overestimates S n . If the initial lognormal distributions 
is broader, C 2 is larger and, thus, larger n's are required 
for the law of large numbers to apply. We analyze now 
more precisely the small n and large n behaviours. 

For n = 1, eq. l|3T)l gives S\ ~ e^~ a which is, as it 
should be, the exact expression for the typical value x t 
of a single lognormal term (see eq. |J3J). For small n, we 
obtain 



n< C 2 



,5/2 



(X) 

C 3 ' 



(38) 
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i.e., a much faster dependence on n then in the usual law 
of large numbers; this evokes a Levy flight with expo- 
nent a = 2/5 (see eq. EU). For large n, the expression 
eq. lpT7Jl expands into |45|: 



n> C 2 



S„ 



n(x) 



(39) 



The practical consequences of these expressions appear 
clearly on the sample mean Y n : 



v = 

n 



(40) 



Eq. H38|) and eq. H39[) give the typical sample mean Y 7 

rx _ f_n_\ 3 /' 2 



n<^C 2 



\C 2 J 



(41a) 



1 1 MINI- 

o = o.i ; 

0=0.25 : 




o = 2 . 
I 



1000 



n>07 2 : F„* ~ (x) - - — (x). (41b) 



3C 2 
2 n 



Thus, for small systems (n <C C 2 ), one has <§; (x). In 
other words, i/ie sample mean of a small system does not 
typically yield the average value. For instance, if a 2 = 4, 
Yi ~ (x)/400. This is important, e.g., for tunnel junc- 
tio ns H and contradicts common implicit assumptions 
[3fll3l| . For large systems (n> C 2 ), one recovers the av- 
erage value. However, the correction to the average value 
decreases slowly with n, as 1/n, and might be measur- 
able even for a relatively large n|46j. Thus, macroscopic 
measurements may give access to microscopic fluctua- 
tions, which is important for physics applications. Usu- 
ally, microscopic fluctuations average out so that they 
can not easily be extracted from macroscopic measure- 
ments. This property, often taken for granted, comes 
from the fast convergence of sums S n to the law of large 
numbers asymptotics, which only occurs with narrow dis- 
tributions. 

We consider now the peak height g n (Y%) of the distri- 
bution 



g n (Y n )=nf n *(S n ) 



(42) 



of the sample mean Y n (figure |SJ. Combining eq. i|15|) 
and eq. lj3"Tl) via eq. and eq. (|37))l gives: 



l + C 2 /n 



2n(x)y/\n(l + C 2 /n) 



(43) 



A simple study, for the non trivial case a 2 > 1/2, reveals 
that g n (Xn) decreases from n = 1 to n = C 2 /(e 1//2 — 1) 
(> 1) and then increases for larger values of n. This 
echoes the non- monotonous dependence on a 2 of the peak 
height of a lognormal distribution / (see eq. ljT5|) and re- 
lated comments). The increase at large n simply corre- 
sponds to the narrowing of the distribution of Y n — S n /n 
when n increases, as predicted by the law of large num- 
bers. Moreover, the large n expansion of eq. 1) 1.'>[1 gives 



FIG. 5: Peak height g n (y£) of the distribution of the sam- 
ple mean Y n — S n /n. Initial lognormal distribution: LN(/i = 
0,cr 2 ). For narrow lognormal distributions (a 2 < 1/2), g n (Y^) 
always increases with n (normal behaviour). For broader log- 
normal distributions (a 2 > 1/2), g n (Y^) presents an unusual 
decrease with n at small n indicating that the peak of g n (Y n ) 
broadens, even if its far tail becomes lighter as usual. 



g n (Y*) ~ , ^™ , which is the prediction of the cen- 

tral limit theorem, as it should be. On the other hand, 
the decrease of g n (Y£) at small n is less usual. The peak 
of g n {Xn) is actually broader than the one of the uncon- 
volutcd distribution g\ — f . This behaviour can be un- 
derstood in the following way. If the lognormal distri- 
bution f(x) is broad enough (C 2 ^> 1), it presents at 
the same time a high and narrow peak at small x and 
a long tail at large x. The effect of convoluting / with 
itself is first (n < C 2 j(e}l 2 — 1)) to 'contaminate' the 
peak with the (heavy) tail. This results in a broadening 
and decrease of the f n *{S n ) peak which is strong enough 
to entail a decrease of the g n (Yn) = nf n *(S n ) peak. On 
the contrary, when enough convolutions have taken place 
(n > C 2 /(e 1 / 2 — 1)), the shape parameter a 2 (eq. 
becomes small and the tail of /"* becomes light. Under 
these circumstances, further convolution mainly 'mixes' 
the peak with itself. This results in a broadening and 
decrease of the f n *(S n ) peak which is weak enough to 
allow an increase of the g n (Yn) = nf n *(S n ) peak. 

The small n decrease of g n {Y n ) has physical conse- 
quences. There is a range of sample sizes, corresponding 
to n < C 2 /(e 1 / 2 — 1) for which the precise determina- 
tion of the typical values becomes more difficult when 
the sample size increases. This is a striking effect of the 
broad character of the lognormal distribution^^] • O n the 
contrary, for narrow distributions, the determination of 
the typical value becomes more accurate as the sample 
size increases. 

At last, we examine the compatibility of the obtained 
/"* with the central limit theorem by studying the distri- 
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bution h n (Z n ) = f n *(S n )dS n /dZ n of the usual rescaled 
random variable Z.„ : 



S n - n(x) 



yj n var (x) 

Simple derivations using eq. (|13[) and eq. lead to 



(44) 



h n {Z n ) 



C 



2nn 



exp 



21n(l + f) 



(45) 



For n > C 2 and \CZ n /y/n\ < 1, one has ln(l + C 2 /n) ~ 
C* 2 /ri and ln(l + CZ n /yfk) ~ CZ n /^ - (CZ n ) 2 /2n, 
which gives: 



h n {Z n ) 



exp 



Clearly, the central limit theorem is recovered 48J : 



(46) 



K{Z n ) -> — ^e _z " /2 when n -»■ oo, (47) 
v 2ir 



consistently with the strategy defined in Section ITVl fsee 
eq. Q29|lh Moreover, the square of the coefficient of vari- 
ation appears in eq. I|46[l as the convergence scale of /"* 
to the central limit theorem. As shown in eq. i|37|) . C 2 
is also the convergence scale of S n to the law of large 
numbers. 



C. Case of very broad lognormal distributions 

We consider here the case er 2 ^> 1 of very broad log- 
normal distributions. To treat this complex case, we will 
proceed through different steps, in a more heuristic way 
than in the previous cases. 

The first step is to assume that the sums S n are typi- 
cally dominated by the largest term M n , if n is not too 
large (see eq. (|2UJ) and Section EH 49]. Thus, the dis- 
tribution function of S n , defined as the probability that 
S n < x and denoted as Pr(S f „ < x), is approximately 
equal to the distribution function of M n , denoted as 
Pr(M„ < x): 



a 2 > 1 : Pr (Sn < x) ~ Pr (M n < x) 



(48) 



As M n is the largest term of all Xi's, M n < x is equivalent 
to Xi < x for all i — 1, . . . , n. Thus, 

Pr (M n < x) = Pr (xi < x) x • ■ ■ x Pr (x n < x) = [F (x)] n 

(49) 



where F (x) = J /(x')dx' is the distribution function of 
the initial lognormal distribution. This implies 



Pr(S„<x)~[F(x)]" 



(50) 



|50| . By definition, the typical sum S n is given by 
d 2 Pr(S„ < x)/dx 2 — 0, which, from eq. lfoT7|) . leads to: 



{o + y n ) V27r$ (y n ) + (n - 1) e 







(51) 



where y n 

-1/2 PV 



= a (In S* and $ (y) = 

(2vr) 1/z e~" / 2 du is the distribution function 
of the standard normal distribution N (0, 1). This 
equation has no exact explicit solution. However, as 
yi = — <t <C —1 (use eq. © with S\ = x*), let us assume 
that y n <C — 1 also for n > 1. Then we can approximate 
$(y„) by $(y„) ~ - e -»»/ 2 /v^„ _ (see, e.<?, 0, 
chap. 26). This leads to a linear equation on y n} giving 
y n ~ —cr/n, valid for y„ -C — 1, i.e., n < er. Finally, one 
has: 



a 2 > l,n < a : Si 



-a 2 In 



(52) 



For n — 1 this expression is exact. When n increases 
till Ti = <j 2 , eq. H52[) gives an unusually fast, exponential 
dependence on n that is in contrast with, e.g., the n 5 ^ 2 
dependence obtained for a 2 < 1 and n -C C 2 (eq. J38J)). 
Unfortunately, when n becomes larger, eq. (|52|l is qual- 
itatively wrong. Indeed, it implies S n /n — > e^/n — > 
instead of S^/n — * (x) as predicted by the law of large 
numbers. 

The second step, improving eq. I|52|l . consists in com- 
bining eq. (|52|l with a cumulant constraint. We assume 
that /"* ~ LN al) as in Section EH for all n = 2 j , 
with j — 1,2,... and that the typical sum SL+i is 
e M 2 j - a 2 i I 2 as in eq. I|52|l since S 2 j is considered as lognor- 



mal. We use these assumptions and (S 2 j+i) = 
determine induction relations between (fi^+x 
(/i 2 3 , c 2 j ) , which leads to: 



2 i+i 



'2J + 1 



\x) to 
) and 



M2i = M 



-ln2 



1 - 



In 2, 



(53a) 



1 - 



jln2. (53b) 



The typical sum is then 



S n ~ n(x) exp 



2n ln(3/2)/ln2 



3 In 2 1 



1 



n ln(3/2)/ln2 

(54) 

Eq. (|54|l is still exact for n = 1 and it clearly improves 
on eq. (|52ll for large u. Indeed, when n — * oo, 5*/rt no 
longer tends to 0. However, S n /n tends to (x)/8 instead 
of (x) , which is the signature of a leftover problem. This 



comes from the assumptions that f n * 



LN ( Mr , 
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which may be correct for large n (small ofj but is exces- 
sive for small n (large a 2 ,), and that S^ j+1 — e^ 2J ~ a v / 2 , 
which is correct for small n — V (large cr 2 ) but is exces- 
sive for large n (small a n ). 

The third step, in order to cure the main problem of 
eq. 1)54(1 . is to wildly get rid of the last term in the ex- 
ponential which prevents from converging to n(x) at 
large n, which does not affect the validity for n — 1: 



> 1 



S n 



n(x) exp 



2n ln(3/2)/ln2 



(55) 



We have tried to empirically improve this formula by 
looking for a better exponent a than In (3/2) /In 2 for 
a 2 G [0.25, 16]. Unfortunately, no single a value is ade- 
quate for all cr's. Eq. (|55|l with a = In (3/2) / In 2 stands 
up as a good compromise for the investigated u-range. 
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VI. RANGE OF VALIDITY OF FORMULAE 



FIG. 6: Distributions g n (Y n ) of the sample mean Y n for an 
initial lognormal with fi — and a — 1.5. 



In this section we proceed to the numerical determi- 
nation of the range of validity of the three theoretical 
formulae given by eq. QPt , e q. EH an d eq. ifSlfl) for the 
typical sum of n lognormal terms. 

In order to fulfil this task, the typical sample mean 
Y n ( e q- 601) instead of 5* will be used. This has the 
advantage of showing only the discrepancies to the mean 
value without the obvious proportionality of on n re- 
sulting from the law of large numbers. The values of 
computed using the three theoretical formulae eq. (|30ll . 
eq. and eq. (£>5J) are called Y* v Y* n and Y* 
spectively: 



< 1 



yt _ 

1 n,II — 



yt 
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n 



-3/2 



> 1 = Yn,m = (x) exp 



2 n ln(3/2)/ln2 



(56a) 



(56b) 



(56c) 



The exact typical sample mean, derived from Monte 
Carlo generation [5l| of the distributions g n (Y n ), is called 
Y 7 \ cx . Enough Monte-Carlo draws ensure negligible sta- 
tistical uncertainty. As an example, we show in figure El 
the obtained distributions g n (Y n ) for /j, = and a = 1.5. 
Notice that Y* ex moves from Yf cx = x l = e~ a ~ 0.11 



to F* 



= IX) = e 



= 



3.08. To determine the 



Y£ ex 's, shown as solid line in figure [7| the absolute max- 
imum of g n (Y n ) is obtained by parabolic least square 
fits performed on the log / log representation of each 
distribution Moreover, in the latter figure, we also 
show y^j (dots), (circles) and Y£ UI (squares). 

To determine the validity range of the theoretical for- 
mulae, we define two error estimators. The first one is 
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FIG. 7: Y niCX (solid line) for an initial a = 1.5 = 0) as 
well as Yn,\ (c 2 <§C 1, dots), Y* tU (a 2 < 1, circles) and Y* m 
(a 2 3> 1, squares). 



the maximum relative error £ r el,(i, n. or in) , i-e., the 
maximum deviation referred to the minimum between 
Y* ^ n or j j j -j and 3^ j<nc , which is defined as follows: 

/ yt yt yt yt \ 

e / n,i n,cx n.cx n,i -in l 

o T el,i = max — , — t ; n = 1, 2, ... I 

V n.cx n.i / 



and can be transformed into: 

5 Ie \ ; i = max el ( v <Cx) I — 1; n = 1, 2, . 

The second one is the maximum scale error 
<5scaie.(i.n or in) > '- e -j the maximum deviation in 
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FIG. 8: Maximum relative errors <5 rB i ; as functions of a. 



FIG. 9: Maximum scale errors 5 Bca \ c j as function of a. 



magnitude referred to the total amplitude of the 
phenomenon: 



^scaic.i = max 



m(Y t /Y t ) 



m (^ODjOx/^l'ex) 



in = 1,2, 



Using eq. © for Y^ ex and eq. JT7J) for i^.ex. We,; 
down to: 

<5 S caio,i = max 



* ) 



3a 2 



;n = l,2,. 



(59) 



boils 



(60) 



Remark that J rc i,j = exp ^^§-<5 S caic.i) — 1. The first step 
for computing <5 rc u an d <5 S caic.i is thus to find the value of 
n for which lln (Y£ i/Yn ex) I i s maximum. For the data 
shown in figure Q we find n = 1 for eq. H56a|) . n = 4 for 
eq. (I56b|l and n — 4 for eq. H56c|l , which gives S IC \ i = 
849% (<5 sca ie,i = 67%), 5 rel ,n = 61% (5 sca i e ,„ = 14%) and 

<5rel,III = 31% (^scaloJII = 8%). 

To work out the dependences of <5 rc i ^ (figure [SJ and 
^scaie.i (figure |5J as functions of er, the same kind of cal- 
culation is performed for a £ (0, 4] which is the relevant 
range for the chosen physics applications. The dot- 
ted lines representing <5 rc i.i and ^ S caie,i show that the first 
theoretical formula is the least accurate in the explored 
a range. However, for its domain of application, tr 2 <C 1, 
the error is acceptable for <5 re i,i (<5 rc u — er 2 , see [5^ )■ 
Indeed, (5 re i,i < 7% for a £ [0, 0.25] which, in turn, means 
that lognormal distributions are quasi-Gaussian in this 
range (see shaded area in figure^J. The solid lines repre- 
senting 5 rc \ t u and 5 sca io,n show that the second theoretical 
formula is the most accurate in the range < a < 1.25 
giving <5 re i,n < 30% and <5 SC aic,ii ^ 10%. Note that good 
tunnel junctions fall within this a range. The broken 
lines representing <5 ro i.iii and ^scaio.in show that the third 
theoretical formula is the most accurate for a > 1.25 and 
is reasonably accurate for a < 1.25. Note that, for a = 4, 



the maximum relative error <5 r ei,ni — 400% appears quite 
high. However, when the error is referred to the total 
amplitude of the scaling, as given by 5 sca ie in, it is only 
7%. 

Importantly, the observed ranges of validity of the 
three different formulae are consistent with the strategies 
of approximation used to derive these formulae. This 
provides an a posteriori confirmation of the theoretical 
analysis presented in the paper. 



VII. A STRIKING EFFECT: SCALING OF THE 
SAMPLE MEAN AND OF ITS INVERSE 

In general, if a function is increasing, its inverse is de- 
creasing. What happens if one considers the typical val- 
ues of a random variable and of its inverse ? Does one 
have : 



(V*0 



(61) 



While this is intuitively true for narrow distributions, it 
may fail for broad distributions. 

This problem arises in electronics, where it is custom- 
ary to study the product RxAof the device resistance R 
by the device size A. One usually checks that R x A does 
not depend on A, otherwise this dependence is taken as 
the indication of edge effects. The resistance R being the 
inverse of the conductance can be represented by 1/S n 
where S n is the sum of n independent conductances. The 
size A of the system is proportional to n. Hence, one has: 



R x A cx 



n 

Sn 



1 

YZ 



(62) 



where Y n is the sample mean of conductances. We have 
shown that the typical Y n increases with the sample 
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TABLE I: Range of applicability of the different formulae. 
Errors are measured by <5 le i and <5 SC aie, see Section fVTl for de- 
tails. 





a range <5 rc i 


^scale 




[0,0.25] < 7% 


< 67% 


n(x) ( 1+ 4) a/2 


[0, 1.25] < 30% 


< 10% 


n(a;)exp[ ^ nln(3 ; 2)/ln2 ] 


[1.25, 4] < 400% 


< 7% 



size (see eqs. (I56H 1 ). if conductances are lognormally dis- 
tributed. Hence, Rx A being proportional to the inverse 
of Y n , one naively expects a decrease of the typical value 
of R x A with n oc A. 

What do the results presented in this paper imply for 
the typical value of R x A? Let us do the correct calcu- 
lation in the case a 2 < 1, relevant for good tunnel junc- 
tions. As /"* (S n ) ~ LN (fi n> o-l) (S n ), the distribution 
of 1/Y n is: 



LN(-M„ + lnn,a£) (1/F„) 



(63) 



(see Section[nJ. The typical sample mean inverse is thus, 
using eqs. iffit)) and (|3l))) : 



a 2 < 1 : (\/Y n f 



1 



, 1/2 ■ 



(64) 



Thus just as Y*, (1/Y n ) increases with the sample size! 

This counterintuitive result epitomizes the paradoxical 
behaviour of some broad distributions. Moreover, this 
can be a possible explanation for the anomalous scaling 
of R x A observed for small magnetic tunnel junctions 

m. 



VIII. CONCLUSION 

We have studied the typical sums of n lognormal ran- 
dom variables. Approximate formulae have been ob- 
tained for three different regimes of the shape parameter 
a 2 . Table [I] summarizes these results with their ranges of 
applicability. These results are relevant up to a < 4; for 
larger a, one may apply the theorems in |24j | and (2FJ- 

The anomalous behaviour of the typical sums has been 
related to the broadness of lognormal distributions. For 
large enough shape parameter a 2 , the behaviour of log- 
normal sums is non trivial. It reveals properties of broad 
distributions at small sample sizes and properties of nar- 
row distributions at large sample sizes with a slow tran- 
sition between the two regimes. Counter-intuitive effects 
have been pointed out like the decrease of the peak height 
of the sample mean distribution with the sample size and 
the fact that the typical sample mean and its inverse do 
not vary with the sample size in opposite ways. Finally, 
we have shown that the statistical effects arising from 
the broadness of lognormal distributions have observable 
consequences for moderate size physical systems. 
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APPENDIX A: APPROXIMATION OF NARROW 
LOGNORMAL DISTRIBUTIONS BY NORMAL 
DISTRIBUTIONS AND VICE VERSA 

As seen in Section [H] the lognormal probability dis- 
tribution f(x) = LN(/x, o~ 2 )(x) is mostly concentrated in 
the interval [e M e _<T , e^e 17 ]. If <j -C 1, this range is small 
and can be rewritten as: 



(I- o-) <x <e^ + . 



(Al) 



Thus it makes sense to expand f(x) around its typical 
value by introducing a new random variable e defined 
by: 

x = e»(l + e), (A2) 

where e is a random variable on the order of a: 

-a < e < a. (A3) 

As a <C 1, this entails |e| <C 1. Expanding the lognormal 
distribution f(x) of eq. m powers of e leads to: 



i 



\/27rcr 2 e^ 



exp 



2a 2 + 2a 2 



The dominant term gives f(x) 
thus using eq. ljA2(l : 

f(x) ~ ; exp 



{x-e^Y 
2 (ae^f 



exp 



(A4) 

(-*)• 

(A5) 



In other words, a narrow lognormal distribution is well 
approximated by a normal distribution: 



cr< 1 : LN (fj,, a 2 ) ~ N (V\ (ere' 1 ) 2 ) 



(A6) 



More intuitively, the Gaussian approximation of nar- 
row lognormal distributions LN(/i,a 2 ) (i) can be in- 
ferred from the underlying Gaussian random variable y 
with distribution N (0, 1) (y), with x = e^ +ay . Since 
1 2/| — 1 an d < 1, one has \ay\ <C 1 and, thus, 
x ~ e M (1 + ay). Consequently, x being a linear trans- 
formation of a Gaussian random variable, is itself nor- 
mally distributed according to N (e 11 , (ce' 1 ) 2 ) , in agree- 
ment with eq. (|A6|) . 
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Conversely, a narrow (cr <C (J,) Gaussian distribution 
N (/i, cr 2 ) can be approximated by a lognormal distribu- 
tion: 

a < 1 : N (//, cr 2 ) ~ LN (in /i, (cr//i) 2 ) . (A7) 

For completeness, one can easily show that any 
Gaussian distribution N (//, cr 2 ) can be approxi- 



mated by a three parameter lognormal distribution 

LN ^ln (fi + A) , (j^r^j ,A*J where A is any number 

such that A+n ^S> cr. The probability density of the three 
parameter lognormal distribution is LN (/i, cr 2 , A) — 

^%^exp{-M£^l!} for x > A and oth- 

erwise. 
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